Data-Driven Sub-Units and Modeling Structure for Continuous Sign Language Recognition with Multiple Cues

نویسندگان

  • Vassilis Pitsikalis
  • Stavros Theodorakis
  • Petros Maragos
چکیده

We investigate the automatic phonetic modeling of sign language based on phonetic sub-units, which are data driven and without any prior phonetic information. Visual processing is based on a probabilistic skin color model and a framewise geodesic active contour segmentation; occlusions are handled by a forward-backward prediction component leading finally to simple and effective region-based visual features. For sign-language modeling we propose a modeling structure for data-driven sub-unit construction. This utilizes the cue that is considered crucial to segment the signal into parts; at the same time we also classify the segments by implicitly assigning labels of Dynamic or Static type. This segmentation and classification step disentangles Dynamic from Static parts and allows us to employ for each type of segment the appropriate cue, modeling and clustering approach. The constructed Dynamic segments are exploited at the model level via hidden Markov models (HMMs). The Static segments are exploited via k-means clustering. Each Dynamic or Static part, exploits the appropriate cue related to the movement. We propose that the movement cues are normalized in order to be translation and scale invariant. We apply the proposed modeling for further combination of the movement trajectory individual cues. The proposed approaches are evaluated in recognition experiments conducted on the continuous sign language corpus of Boston University (BU-400) showing promising preliminary results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advances in Dynamic-Static Integration of Movement and Handshape Cues for Sign Language Recognition

We explore the integration of movement-position (MP) and handshape (HS) cues for sign language recognition. The proposed method combines the data-driven subunit (SU) modeling exploiting the dynamicstatic notion for MP and the affine shape-appearance SUs for HS configurations. These aspects lead to the new dynamic-static integration of manual cues. This data-driven scheme takes advantage of the ...

متن کامل

Hand Tracking and Affine Shape-Appearance Handshape Sub-units in Continuous Sign Language Recognition

We propose and investigate a framework that utilizes novel aspects concerning probabilistic and morphological visual processing for the segmentation, tracking and handshape modeling of the hands, which is used as front-end for sign language video analysis. Our ultimate goal is to explore the automatic Handshape Sub-Unit (HSU) construction and moreover the exploitation of the overall system in a...

متن کامل

Dynamic-static unsupervised sequentiality, statistical subunits and lexicon for sign language recognition

a r t i c l e i n f o We introduce a new computational phonetic modeling framework for sign language (SL) recognition. This is based on dynamic–static statistical subunits and provides sequentiality in an unsupervised manner, without prior linguistic information. Subunit " sequentiality " refers to the decomposition of signs into two types of parts, varying and non-varying, that are sequentiall...

متن کامل

The Factor Structure of a Written English Proficiency Test: A Structural Equation Modeling Approach

The present study examined the factor structure of the University of Tehran English Proficiency Test (UTEPT) that aims to examine test takers’ knowledge of grammar, vocabulary, and reading comprehension. A Structural Equation Modelling (SEM) approach was used to analyse the responses of participants (N= 850) to a 2010 version of the test.  A higher-order model was postulated to test if the unde...

متن کامل

A parallel multi-stream model for sign language recognition

In this paper, the sub-units in each stream are used and embedded in the multi-stream model. In this framework, sign language recognition system was implemented and evaluated. Experiments were carried out for 5177 Chinese signs. The real time isolated recognition rate is 95.1%. For continuous sign recognition, the word correct rate is 91.8%. This has shown that parallel multi-stream model is po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010